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Abstract 

Topology of urban environments can be represented by means of graphs. 
We explore the graph representations of several compact urban patterns 
by random walks. The expected time of recurrence and the expected first 
passage time to a node scales apparently linearly in all urban patterns we 
have studied In space syntax theory, a positive relation between the local 
property of a node (qualified by connectivity or by the recurrence time) 
and the global property of the node (estimated in our approach by the 
first passage time to it) is known as intelligibility. Our approach based on 
random walks allows to extend the notion of intelligibility onto the entire 
domain of complex networks and graph theory. 

Keywords: Space syntax, random walks, recurrence times, first 
passage times 

1 Spatial networks of urban environments 

More than half of all humans are now living in cities |Ash:2008 et q/j . 
Cities are responsible for a great deal of global energy consumption 
and of greenhouse gas emissions. The global challenges of sustain- 
able development call for a quantitative theory of urban organi- 
zation. There is a well-established connection between the den- 
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sity of an urban environment, and the need to travel within it 
[Wheeler: 1998] . Good quality of itineraries is one of the necessary 
conditions for avoiding stagnation and collapse of a city. 

Studies of urban transportation networks have a long history. 
In his famous paper on the seven bridges of Konigsberg published 
in 1736, L. Euler had proven the first theorem of graph theory 
Biggs: 1986 et al\ . In Euler's solution, each urban landuse mass is 
considered as a node of a planar graph, and the bridges connect- 
ing them are the edges. Euler had found that a route travelling 
along each edge in the planar graph representation of the ancient 
Konigsberg did not exists. In the primary graph representation of 
urban transport networks originated from the work of Euler, the 
relationships between the different components of urban environ- 
ments are often measured along streets and routes considered as 
edges, while the traffic end points and street junctions are treated 
as nodes. In the last century, primary city graphs have been used ex- 
tensively in many studies devoted to the improving of transportation 
routes, the optimization of power grids, and the surveys of human 
mobility patterns. 

Another graph representation of urban transport networks is based 
on the ideas of traffic engineering and queueing theory invented by 
A.K. Erlang (see |Brockmeyer:1948 et a/| ). It arises naturally when 
we are interested in how much time a pedestrian or a vehicle would 
spend while travelling through a particular place in a city. In such a 
secondary graph representation, any space of motion is considered as 
a service station of a queuing network characterized by some time 
of service, and the relations between streets, squares, and round- 
abouts are traced through their junctions. Travellers arriving to 
a place are either moving through it immediately or queuing until 
the space becomes available. Once the place is passed through, the 
traveller is routed to its next station, which is chosen according to 
a probability distribution among all other open spaces linked to the 
given one in the urban environment. 

In general, the secondary graph representations of urban envi- 
ronments are not planar. Moreover, they are essentially similar 
to those of "dual information representation" of a city map in- 
troduced by [Rosvall:2005 et aTj and to the "dual graphs" exten- 
sively investigated within the concept of space syntax, a theory 
developed in the late 1970s, that seeks to reveal the mutual ef- 
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fects of complex spatial urban networks on society and vice versa, 
|Hillier:1984"iTaI[ IHillierTT999] . 

In space syntax theory, built environments are treated as systems 
of spaces of vision subjected to a configuration analysis. Being irrel- 
evant to the physical distances, dual graphs representing the urban 
environments are removed from the physical space. Spatial percep- 
tion shapes peoples understanding of how a place is organized and 
eventually determines the pattern of local movement, |Hillier:1 999], 
The aim of the space syntax study is to estimate the relative prox- 
imity between different locations and to associate these distances 
to the densities of human activity along the links connecting them, 
[Hansen: 19591 IWilson: 19701 Batty:2004|. The surprising accuracy of 



predictions of human behaviour in cities based on the purely topo- 
logical analysis of different urban street layouts within the space 
syntax approach attracts meticulous attention |Penn:200T] . Space 
syntax proves its usefulness for the planning and redevelopment of 
certain city districts around the world, the designing of commercial 
centres, museums, railway stations, and airports where easy way- 
finding is a significant issue. 

The decomposition of urban spatial networks into the complete 
sets of intersecting open spaces can be based on a number of different 
principles. In Jiang: 2004 et al , while identifying a street over a 



plurality of routes on a city map, the named-street approach has 
been used, in which two different arcs of the primary city network 
were assigned to the same identification number (ID) provided they 
share the same street name. The main problem of the approach is 
that the meaning of a street name could vary from one district or 
quarter to another even within the same city. For instance, some 
streets in Manhattan do not meet the continuity principle rather 
playing the role of local geographical coordinates. 

In [Porta: 2006 et a/j . an intersection continuity principle (ICN) 
has been used: two edges forming the largest convex angle in a 
junction on the city map are assigned the highest continuity and 
therefore were coupled together, acquiring the same street identifi- 
cation number (ID). The main problem with the ICN principle is 
that the streets crossing under convex angles would artificially ex- 
change their identifiers, which is not crucial for the study of the 
probability degree statistics [Porta:2006 et a/j . but makes it difficult 
to interpret the results if the dynamical modularity of the city is 
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detected |Volchenkov: 2007a et al\ . It is also important to mention 
that the number of street IDs identified within the ICN principle 
usually exceeds substantially the actual number of street names in a 
city In [Cardillo:2006 et al IScellato:2006 et al ICrucitti:2006 et al\ , 
the probability degree statistics and some centrality measures in 
different world cities have been investigated for a number of one 
square mile representative samples. However, the decision on which 
a square mile would provide an adequate representation of a city is 
always questionable. 

In |Figueiredo:2005 et al\ , the approach of [Cardillo:2006 et ah, 
IScellato:2006 et ah, ICrucitti:2006 et a\ has been improved in the 
sense that two intersecting lines of vision were aggregated into the 
same node of the dual graph representation if and only if the angle 
between the linear continuation of the first line and the second line 
was less than or equal to a predefined threshold. If more than one 
continuation was available, the line forming the smaller angle was 
chosen. 

In our paper, we take a "named-streets" -oriented point of view on 
the decomposition of urban spatial networks into the complete sets 
of intersecting open spaces following our previous works [Volchenkov:2007a et ah, 
IVolchenkov:2007b et a/j . Being interested in the statistics of random 
walks defined on spatial networks of urban patterns, we assign an 
individual street ID code to each continuous segment of a street. 
The secondary graph is then constructed by mapping all edges of 
the primary graph shared the same street ID into nodes and all in- 
tersections among each pair of edges of the primary graph into the 
edges of the secondary graph connecting the corresponding nodes. 



2 The scope of the study and results 

In the present paper, we explore the secondary graph represen- 
tations of different compact urban patterns (the Venetian channel 
network, the city of Paris, enclosed by the Peripheral Boulevard, and 
the almost regular street grid in Manhattan) by means or random 
walks. 

In the forthcoming section (Sec. [3]) we discuss the connectivity 
statistics of secondary graphs representing urban environments. In 
general, compact urban patterns have been developed under the 
deficits of physical space and therefore bear the multiple finger- 
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prints of the physical landscapes being scale dependent in general. 
However, the large urban patterns which have not been spatially re- 
stricted during their development could constitute a highly hetero- 
geneous scalable spatial networks as we demonstrate for the spatial 
network of the city of Paris bounded by the Peripheral Boulevard. 

In the Sec. HJ we explain how random walks can be used in order 
to explore complex networks. 

In Subsec. I4.1[ we demonstrate that the transition operator of a 
random walk appears naturally as the representation of the group 
of graph automorphisms in a class of stochastic matrices. Random 
walks provide us with an effective tool for the detailed structural 
analysis of connected undirected graphs exposing their symmetries. 
It is well known that while being defined on an undirected graph, 
random walks determine a unique stationary probability distribution 
for every node jLovasz:1993j . In Sec. 14.2} we show that each node of 
a connected undirected graph can be characterized with respect to 
random walks by the expected recurrence time and the expected first 
passage time. The expected recurrence time is simply the inverse of 
the stationary probability distribution of random walks for the given 
node and is therefore the local property of the node. The expected 
first passage time figures out a global relation between the node and 
other nodes of the given graph accounting for all possible random 
paths toward the node accordingly to their respective probabilities. 

In Sec. 14.31 we show that for any undirected graph, it is possible 
to define a linear self-adjoint operator and then use its nice spec- 
tral properties in order to extract the information about the graph 
structure. In particular, we demonstrate that the complete set of 
orthonormal eigenvectors of the symmetric transition operator can 
be used in order to introduce the structure of Euclidean space on 
the graph. 

In Sec. 14.41 we show that any node of the graph can be rep- 
resented as a vector in the (N — l)-dimensional vector space and 
that Euclidean distances and angles between nodes have clear prob- 
abilistic interpretations. In particular, the square of the norm of the 
vector representing a node of the graph equals to the expected first 
passage time to it by random walkers. We can conclude that random 
walks embed connected undirected graphs into (iV — l)-dimensional 
Euclidean space. 

The main result of our paper is explained in Sec. where we have 
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shown that the expected recurrence time scale apparently linear with 
the expected first passage times in compact urban environments we 
have studied. 

A similar strong positive relation between the local property of 
a place and its global properties with respect to other places in the 
dual graph of a city was known for a long time in the framework of 
spaces syntax theory |Hillier:1999j . Our approach based on inves- 
tigation of complex networks by means of random walks allows us 
to extend the notion of intelligibility far beyond space syntax onto 
the entire domains of complex networks and the general theory of 
graphs. 



3 Connectivity statistics in secondary graphs rep- 
resenting urban environments 

The degree of a node representing a place in the secondary graph 
representation of an urban environment is the number of locations 
directly adjacent to the given one in the city. In space syntax theory, 
the degree of a node (i.e. connectivity) is considered as a local 
characteristic quantifying how well the space is connected to others 
in the urban pattern, |Klarqvist:19 93|. 

The probability degree distribution, 

P(k) = Pr [i e G|deg(i) = k] , (1) 

suggests that a node selected uniformly at random has a certain 
degree k with the probability P(k). The probability degree dis- 
tribution is an important concept characterizing the topology of 
complex networks. It originates from the early studies of random 
graphs by Erdos and Renyi, |Erdos:1959 et a\ et al as a common 
way to classify large graphs into categories such as random graphs 
[Er dos: 1959 et a\ et al and scale-free networks |Barabasi:2004j . 

It has been reported earlier by |Jiang:2004 et a/[IVolchenkov:2007a et al\ 



that the secondary graphs representing the urban environments un- 
der the street-name identification principle exhibit the small-world 
property |Newman:2001 et~al\ . but the scale-free probability degree 
distributions pertinent to the scale-free graphs can hardly be rec- 
ognized. In general, compact city patterns do not provide us with 
sufficient data to conclude on the universality of degree statistics. 
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To give an example, we display in Fig. [T] the log-log plot of the 
numbers of Venetian channels vs. the numbers of their junctions k. 
The solid line indicates the cumulative empirical probability degree 
distribution, N<#(k) = Y^k'=k N(k')/N, where N = 96 is the to- 
tal number of Venetian channels (including those in the Giudecca 
island), and N(k') is the number of channels crossing precisely k' 
other channels. It is remarkable that the empirical probability de- 




In k 

Figure 1: The log- log plot displays the numbers of Venetian channels N(k) versus 
the numbers k of their junctions. The solid line indicates the cumulative probability 
degree distribution, N^}(k). 

gree distributions observed for the secondary graphs are usually 
broad indicating that the itineraries can cross different numbers of 
other routes. Nevertheless, the distributions usually have a clearly 
recognizable maximum corresponding to the most probable num- 
ber of junctions an average transport route has in the city. The 
distributions usually have a long right tail that decays faster then 
any power law due to just a few routes that cross many more oth- 
ers than in average, [Volchenkov :2007a et a/j . This conclusion has 
been recently supported by |Figueiredo:2007 et al\ where it has been 
suggested that in general the probability degree distributions of sec- 
ondary graphs are scale-dependent. 

It is important to note that in the relatively large secondary 
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graphs which may contain many thousands of nodes a power law tail 
can be observed in the probability degree distributions. In Fig. [2j 
we have sketched the log-log plot of the numbers of open spaces 
in the secondary graph of Paris (consisting of 5131 interconnected 
open spaces enclosed by the Peripheral Boulevard) versus the num- 
bers of their junctions with others. The spatial network of Paris 
forms a highly heterogeneous apparently scalable graph. In urban 
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Figure 2: The log- log plot represents the empirical degree statistics for the secondary 
graph of Paris (5131 interconnected open spaces enclosed by the Peripheral Boulevard) . 
N(k) is the number of locations adjacent to exactly k other locations on the map of 
Paris. The slope of the regression line equals 2.425. 

studies, scaling and universality have been found with a remarkable 
regularity. The evolution of social and economic life in cities in- 
creases with the population size: wages, income, growth domestic 
product, bank deposits, as well as the rate of innovations, measured 
by the number of new patents and employment in creative sectors 
scale super-linearly, over different years and nations, with statisti- 
cally consistent exponents |Florida:2004l IBettencourt:2007 et al\ . 

The probable reason for the similarity is that highly complex, 
self-sustaining structures, whether cells, organisms, or cities con- 
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stitute of an immense number of units are organized in a form of 
self-similar hierarchical branching networks, which grow with the 
size of the organism |Enquist:1998 et aj A generic social dynamic 
underlying the scaling phenomena observed in cities implies that 
an increase in productive social opportunities, both in number and 
quality, leads to quantifiable changes in individual behaviours of 
inhabitants integrating them into a complex dynamical network 
|Macionis:1998 et al\ . 

The famous rank-size distribution of city sizes over many coun- 
tries is known as Zipf's Law |Zipf:19 49|. If we calculate the natural 



logarithm of the city rank in some countries and of the city size (mea- 
sured in terms of its population) and then plot the resulting data in 
a diagram, we obtain a remarkable linear pattern with the slope of 
the line equals —1 (or +1, if cities have been ranked in the ascend- 
ing order), [Soo:2002] . The similar centrality-rank distributions for 
the values of a space syntax measure quantifying the centrality of 
nodes in the secondary graphs of compact urban patterns have been 
recently reported by us in |Volchenkov: 2007b et aJ\ . 



4 Exploration of graphs by random walks 

A graph naturally arises as the outcome of a categorization, when 
we abstract any real world system by eliminating all but one of its 
features and by grouping things (or places) sharing a common at- 
tribute by classes or categories. For instance, the common attribute 
of all open spaces in a city is that we can move through them. All 
open spaces found in a city are considered as physically identical, 
so that we can regard them as nodes of a secondary graph G(V, E), 
in which V is the set of all such spaces, and E is the set of all in- 
terconnections between them. For each graph G(V,E), there exists 
a unique, up to permutations of rows and columns, adjacency ma- 
trix A. In the special case of a finite simple graph (an undirected 
graph with no self-loops), the adjacency matrix is a {0, l}-matrix 
such that Aij — 1 if i ^ j, i ~ j, and A^ = otherwise. The degree 
of a node i G V is therefore given by 

ki = A^. (2) 
For weighted undirected graphs, the adjacency matrix A is replaced 
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by a symmetric positive affinity matrix w. 

4.1 Why random walks? 

The set of graph automorphisms Aut(G), the mappings of the 
graph to itself which preserve all of its structure, is specified by 
the symmetric group Sat that includes all admissible permutations 
II G §tv taking the node i G V to some other node EC(i) G V. The 
representation of Sat consists of all N x N matrices Iln, such that 

(Hn),- nW = !> and ( n n) M = if U{i). 

A linear transformation of the adjacency matrix 

N 

Z = •F'ijalAgi, J-'ijsi G K (3) 

8,1=1 

belongs to Aut(G) if 

nJz(A) n n = z(nlAu u ), (4) 

for any IT G S;y. It is clear that the relation is satisfied if the 
entries of the tensor T in meet the following symmetry property: 

•^ri(i) n(j) n(s) n(0 = -^jsi) (5) 

for any II G Sjy- Since the action of §at preserves the conjugate 
classes of index partition structures, it follows that any appropriate 
tensor T satisfying (jS]) can be expressed as a linear combination of 
the following tensors: {1, 8 is , 8 U , 5 js , 5 jh 8 sl , 8^6^, 5 js 5 sl , 
SsiSu, SuSij, 6ij6 8 i, 5 is 5ji, Sudjs, 8^8x6^} . By substituting the above ten- 
sors into ([3]) and taking account on the symmetries, we obtain that 
any arbitrary linear permutation invariant function Z (A) defined on 
a simple undirected graph G(V, E) must be of the following form, 

Z (A) 4j = oi + 8ij (a 2 + a 3 kj) + a 4 Aij, (6) 

where kj = deg G (j), and 01,2,3,4 being arbitrary constants. 

If we impose, in addition, that the linear function Z preserves 
the connectivity, 

h = Z (A) tf , (7) 

it follows that ai = a 2 = (since the contributions of a±N and a 2 are 
indeed incompatible with (j7j)) and the remaining constants should 
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satisfy the relation 1 By introducing the new parameter 

f3 = 04 > 0, we can reformulate ([6]) in the following form, 

Z{K\. = {l-fiSijkj + PAq. (8) 

It is important to note that ([7]) can be interpreted as a probability 
conservation relation, 

1 = rE z ^' V?G ^ ( 9 ) 

and therefore the linear function Z (A) can be interpreted as a 
stochastic process. 

By substituting (jSJ) into (jHJ), we obtain 

V T (/3) l iU ^ 

in which the operator is nothing else but the generalized ran- 
dom walk transition operator for (3 G [0, 1]. The operator 
defines "lazy" random walks for which a random walker stays in 
the initial vertex with probability I — (3, while it moves to another 
node randomly chosen among the nearest neighbors with proba- 
bility /3/ki. In particular, for (3=1, the operator 7^ describes 
the standard random walks extensively studied in classical surveys 
|Lovasz:1993j . [ATdous et a/j . 



4.2 Recurrence and first passage times 

Simple discrete time random walks are the stochastic processes 
where the position of a walker at time t depends only on its position 
at time t—1. The attractiveness of random walks methods in relies 
on the fact that the distribution of the current node of any undi- 
rected non-bipartite graph after t 3> 1 steps tends to a well-defined 
stationary distribution tt, which is uniform if the graph is regular. 

Since the matrix T^- for any (3 G [0, 1] is a real positive stochastic 
matrix, it follows from the Perron- Frobenius theorem [Horn: 1990 et a/] 
that its maximal eigenvalue is simple and equals 1. The left eigen- 
vector, irT(P) = 7r, associated with the maximal eigenvalue 1 is 
positive, 

n = (ii) 
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and satisfies the normalization condition ^2 i€V 7Tj = 1 independently 
of (3 G [0, 1]. The left eigenvector it is interpreted as the unique sta- 
tionary distribution of this random walk. If the graph G is not 
bipartite, any density function a (<Tj > 0, J2i£V ai = -0 asymptoti- 
cally tends to the stationary distribution it under the actions of the 
transition operator 



7T 



lim a {T^Y . (12) 



t— »oo 

Let us consider a random walk 

Irfxo {-^0) X±, ■ ■ ■ X n i, . . . Xn+m i, ■ ■ ■} 

starting from some node Xq G V chosen randomly among all nodes 
of the undirected graph G. It is clear from (1111) that for long enough 
random walks the probability to find a random walker in a certain 
node i G V equals 7^ that is proportional to the degree of node k-i. 
The expected recurrence time to i G V is given by rhi = tc^ 1 , and 
therefore depends on the local property of the node (its degree). 

The first passage time fii is the expected number of steps required 
for the random walker to reach the node i for the first time start- 
ing from a node randomly chosen among all nodes of the graph G. 
This characteristic time is calculated as an average over all random 
paths toward the node taken into account in accordance with their 
respective probabilities. Being the global characteristic of the node, 
fii estimates the level of accessibility to the node from the rest of 
the graph. 

Let us now calculate the first passage time to a node by using 
spectral analysis of a self-adjoint transition operator. 

Probably, Lagrange was the first scientist who investigated a 
simple dynamical process (diffusion) in order to study the prop- 
erties of a graph, |Lagrange:18 67|. He calculated the spectrum 
of the Laplace operator defined on a chain (a linear graph) of iV 
nodes in order to study the discretization of the acoustic equations. 
The idea of using the spectral properties of self-adjoint operators 
in order to extract information about graphs is standard in spectral 
graph theory |Chung:1997 and in theory of random walks on graphs 



|Lovasz:1993j . [Aldous et a\ . In the following calculations, we take 
f3 — 1 in the transition operator ffTUl) . This choice allows us to com- 
pare our results directly with those known from the classical surveys 
on random walks [Lovasz:1993j . |Aldous et a/j . 
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4.3 Euclidean embedding of graphs by random walks 

The stationary distribution of random walks tt defines a unique 
measure on the set of nodes V with respect to which the transition 
operator ( (TTDjl for (3 = 1) is self-adjoint, 

f=- (tt^Ttt- 1 / 2 + n-^n 1 / 2 ) , (13) 

where T T is the adjoint operator, and 7r is defined as the diagonal 
matrix diag (tTi, . . . , ir N ). In particular, 

% = -^=, (14) 

when G is a simple undirected unweighted graph. The ordered set 
of real eigenvectors {ipi}f =l of the symmetric transition operator T 
forms an orthonormal basis in Hilbert space TC(V). The components 
of the first eigenvector ipi belonging to the largest eigenvalue H\ — 1, 

ipi T = Vi, V'l.i = ?Ti, (15) 

describes the connectivity of nodes. The Euclidean norm in the 
orthogonal complement of ipi , 

N 
s=2 

, gives the probability that a random walker is not in i. The eigen- 
vectors, {ip s }^ =2 , belonging to the ordered eigenvalues 1 > /i2 > 
■ ■ ■ fJ'N — — 1 describe the connectedness of the entire graph G. The 
orthonormal system of functions ip s is useful for decomposing nor- 
malized functions defined on V. 

The symmetric transition operator T projects any density a G 
TC{V) on the eigenvector ipi related to the stationary distribution tt, 
oT — ipi + cr±T, (T± — (7 — ip\i i n which a± is the vector belonging 
to the orthogonal complement of ipi characterizing the transient 
process toward the stationary distribution tc induced by a. 

Given two different densities a, p G H, it is clear that with respect 
to random walks they differ only on their transient parts, but not 
on the final stationary state tt. Therefore, we can compare any 
two densities defined on V by means of random walks. Since all 
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components ipi^ > 0, it is convenient to rescale the densities by 
dividing their components by ipi,i, 

1pl,i y/Ki y/Th 



such that for example aT = 1 + a±T. Then we define the squared 
Euclidean distance between any two densities with respect to ran- 
dom walks by the sum over all times t > 0, 

11 ° - p Ht = Yl ( ?± - ^ ?± ~ p-l ) ' ( 17 ) 

t >0 

where we have used Dirac's bra-ket notations especially convenient 
for working with inner products and rank-one operators in Hilbert 
space. In order to perform the summation over time in fflTj) . it is 
convenient to use the spectral representation of T, 

n 1- 

w-p\\t 

= Et> Ef= 2 pI {° x - p^s){^ x - p l ) (18) 



E 



N ( &±-p± I )( I o-±-p± ) 
s=2 1- Ms 



We conclude the description of the (N — l)-dimensional Euclidean 
space structure induced by random walks defined on G by men- 
tioning that every density a G TC(V) can be characterised by the 
(N — l)-dimensional vector with the norm defined by 



,2 \^ ( 0± I ) ( V's I 0± ) 



s=2 



Moreover, given two densities a,p G H(V), we can introduce a 
scalar product in the (N — l)-dimensional Euclidean space by 



N 



(a,p) T = 2_^ r^— , (20) 

s =2 1 P s 

so that the angle between a, p G 7Y(V) can be calculated as 

cos Z(p,a)= „ ( ,^, )t i , ■ (21) 
II 0" IIt II P llr 
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4.4 First passage time as the Euclidean norm of a node 

Random walks embed connected undirected graphs into the Eu- 
clidean space M^ -1 . This embedding can be used directly in order 
to calculate the first passage times to individual nodes. 

Indeed, let us consider the vector ej = {0, . . . lj, . . . 0} that rep- 
resents the node i G V in the canonical basis as a density function. 
In accordance to (j!9p . the vector e^ has the squared norm of e, 
associated to random walks is 

IM^-Et^. (22) 

It is important to note that in the theory of random walks |Lovasz:1 993j 
the r.h.s. of (1221) is known as the spectral representation of the first 
passage time to the node i G V from a node randomly chosen 
among all nodes of the graph accordingly to the stationary distri- 
bution 7T. The first passage time, nj = ||ej||y, can be directly used 
in order to characterize the level of accessibility of the node i. 

The Euclidean distance between any two nodes of the graph G 
calculated in the (N — 1)— dimensional Euclidean space associated 
to random walks, 




also gets a clear probabilistic interpretation as the spectral represen- 
tation of the commute time, the expected number of steps required 
for a random walker starting at i G V to visit j G V and then to 
return back to i, |Lovasz:1993j . 

The commute time can be represented as a sum, Kij = Hij+Hj i, 
in which 

II ®* IIt ( ®*> )t (24) 
is the first hitting time which quantifies the expected number of 
steps a random walker starting from the node i needs to reach j for 
the first time, |Lovasz:19 93j. 

The scalar product (ej,ej) T estimates the expected overlap of 
random paths towards the nodes % and j starting from a node ran- 
domly chosen in accordance with the stationary distribution of ran- 
dom walks 7i. The normalized expected overlap of random paths 



15 



given by the cosine of an angle calculated in the (N—l)— dimensional 
Euclidean space associated to random walks has the structure of 
Pearson's coefficient of linear correlations that reveals it's natural 
statistical interpretation. If the cosine of an angle (121 j) is close to 1 
(zero angles), it indicates that the expected random paths toward 
both nodes are mostly identical. The value of cosine is close to -1 
if the walkers share the same random paths but in the opposite di- 
rection. Finally, the correlation coefficient equals if the expected 
random paths toward the nodes do not overlap. It is important to 
mention that as usual the correlation between nodes does not nec- 
essary imply a direct causal relationship (an immediate connection) 
between them. 

5 The first passage time and intelligibility of 
complex urban networks 

It is intuitive that the time of recurrence to a node, rhi, has to be 
positively related to the first passage time to it, nf the faster a ran- 
dom walker hits the node for the first time, the more often he is ex- 
pected to visit it in future. This intuition is supported by (|22j) from 
which it follows that n,i oc m; provided the sum X^=2^si/ 0- ~ 
is uniformly independent of the connectivity for all nodes. The pos- 
sible relation between the local and global properties of nodes is the 
most profound feature of a complex network. It is interesting to 
note that this nontrivial property of eigenvectors seems to be true 
for the secondary graphs representing complex urban networks: the 
first passage times to the nodes scale apparently linearly with their 
connectivity. 



^50 50 

First passage times from Broadway 



Figure 3: The 2-dimensional projection of the Euclidean space of 355 locations in 
Manhattan (New York) from Broadway set up by random walks. 

In Fig. [31 we have sketched the 2-dimensional projection of the 
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Euclidean space of 355 locations in Manhattan (New York) set up 
by random walks. Nodes of the secondary graph are shown by disks 
with radiuses taken proportional to the connectivity of the places. 
Broadway, a wide avenue in Manhattan which also runs into the 
Bronx and Westchester County, possesses the highest connectivity 
and located at the centre of the graph shown in Fig. [3J Other places 
are located at their Euclidean distances from Broadway calculated 
accordingly (123]) . and (J2T]) has been used in order to compute angles 
between Broadway and other places. 

A part-whole relationship between local and global properties of 
the spaces of motion is known in space syntax theory as an intel- 
ligibility property of urban pattern |Hillier:1984 et a/] . |Hillier:1999j . 
The adequate level of intelligibility is proven to be a key deter- 
minant of the human behaviour in urban environments encourag- 
ing peoples way- finding abilities, Jiang: 2004 et a/| . In space syntax 
theory, the local property of an open space is qualified by its con- 
nectivity, while its global property is estimated by a special space 
syntax measure called 'integration'. In the traditional space syntax 
analysis [Hillier: 1999] . the integration of a place into urban environ- 
ments is estimated by the normalized sum of all graph theoretical 
distances toward the place from all other places in the city. In 
|Jiang:2004 et al\ , the integration of a place has been estimated by 
means of the centrality of the node in the dual graph representation 
of the urban environment. 

The first passage time to a node which we use in the present paper 
in order to quantify the relation of the node with other nodes in the 
graph has an immediate connection to neither the traditional space 
syntax integration measure discussed in [ Hilher:1984 et a^ . [Hilrier7 l999j 
nor the centrality measure investigated in |Jiang :2004 et afl. How- 
ever, the first passage time indicates a strong positive relation be- 
tween the local and global properties of the spaces of motion in 
urban environments (see Fig H]) in a pretty same way as it has been 
demonstrated in the classical space syntax analysis. 

The approach based on investigation of complex networks by 
means of random walks allows us to extended the notion of intel- 
ligibility far beyond the urban studies where it has been originally 
invented onto the entire domains of complex networks and the gen- 
eral theory of graphs. 
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Figure 4: The times of recurrence to locations in Manhattan scale linearly with the 
first passage times to them. 
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